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We present a measurement of the cross section for W boson production in association with 
at least one &-quark jet in proton-antiproton coUisions. The measurement is made using data 
corresponding to an integrated luminosity of 6.1 fb^^ recorded with the DO detector at the 
Fermilab Tevatron pp Collider at ^/s = 1.96 TeV. We measure an inclusive cross section 
of a{W (-> fiu) + b + X)^ 1.04 ± 0.05 (stat.) ± 0.12 (syst.) pb and a{W{-^ eu) + b + X) = 
1.00 ± 0.04 (stat.) ± 0.12 (syst.) pb in the phase space defined by p^ > 25 GeV, p^^''* > 20 GeV, 
|^b-jet| ^ ^^^^ andamuon (electron) withp^ > 20 GeV and I?]"! < 1.7 (Ir)"] < 1.1 or 1.5 < < 2.5). 
The combined result per lepton family is a{W{~^ Iv) + & + X) — 1.05 ± 0.12 (stat.+syst.) for 
\ri^\ < 1.7. The results are in agreement with predictions from next-to-leading order QCD calcula- 
tions using MCFM, a{W + b) ■ B{W — >■ Iv) = 1.34 ^0^34 (syst.), and also with predictions from the 
SHERPA and MADGRAPH Monte Carlo event generators. 

PACS numbers: 12.38.Qk, 13.85.Qk, 14.65.Fy, 14.70.Fm 



The measurement of the production cross section of 
a W boson in association with a 6-quark jet provides a 
stringent test of quantum chromodynamics (QCD). Pro- 
cesses involving W/Z bosons in association with b quarks 
are also the largest backgrounds in studies of the stan- 
dard model (SM) Higgs boson decaying to two b quarks, 
in measurements of top quark properties in both sin- 
gle and pair production, and in numerous searches for 
physics beyond the SM. The cross section for the pro- 
cess pp ^ W + b + X has been calculated with next- 
to-leading order (NLO) precision [l|, Q. Subprocesses 
at NLO include qq Wbb, qq Wbbg, and 

qg — > Wbbq' . An additional small contribution comes 
from sea b quarks in the incoming proton or antiproton. 
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bq Wbq'. 

In this letter we describe a measurement of the cross 
section for W boson production in association with b- 
quark jets in pp interactions, where a W boson is identi- 
fied via its electronic or muonic decay modes. A measure- 
ment of W+b production cross section with up to two jets 
at = 1.96 TeV has been published by the CDF Collab- 
oration 0] and an inclusive measurement has been pub- 
lished by the ATLAS Collaboration ^ at ^/s = 7 TeV. 
The measured production cross section reported by CDF 
is tr • S(W^ -> ^i/) = 2.74 ± 0.27 (stat.) ± 0.42 (syst.) pb 
(£ = e, I/), while the theoretical expectation 
for this quantity based on NLO calculations is 
1.22 ± 0.14 (syst.) pb With the CDF measurement of 
W + b production exceeding significantly the NLO pre- 
diction, while the ATLAS result is in agreement with the 
expectation, an independent measurement is important 
to understand the production of W bosons in association 
with b jets at hadron colliders. 

The data used in this analysis were collected between 
July 2006 and December 2010 using the DO detector at 
the Fermilab Tevatron Collider at ^/s = 1.96 TeV, 
and correspond to an integrated luminosity of 6.1 fb~^. 
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We first briefly describe the main components of the DO 
Run II detector [Hj relevant to this analysis. The DO 
detector has a central tracking system consisting of a 
silicon microstrip tracker (SMT) [Gl and a central fiber 
tracker (CFT), both located within a 2 T supercon- 
ducting solenoidal magnet, with designs optimized for 
tracking and vertexing at pseudorapidities \r]\ < 3 and 
|?7| < 2.5, respectively 0]. A liquid argon and uranium 
calorimeter has a central section (CC) covering pseudo- 
rapidities I77I < 1.1, and two end calorimeters (EC) that 
extend coverage to « 4.2, with all three housed in 
separate cryostats An outer muon system, at \ri\ < 2, 
consists of a layer of tracking detectors and scintillation 
trigger counters in front of 1.8 T toroids, followed by two 
similar layers after the toroids. Luminosity is measured 
using plastic scintillator arrays located in front of the EC 
cryostats. The trigger and data acquisition systems are 
designed to accommodate the high instantaneous lumi- 
nosities of Run II. 

The W + b candidates are selected by triggering on sin- 
gle lepton or lepton-plus-jet signatures with a three-level 
trigger system. The trigger efficiencies are approximately 
70% for the muon channel and 95% for the electron chan- 
nel. 

W boson candidates are identified in the fi+v and e+v 
decay channels whereas a small fraction of selected events 
arises from leptonical decaying tau leptons. Offline event 
selection requires a reconstructed primary pp interaction 
primary vertex (PV) that has at least three associated 
tracks and is located within 60 cm of the center of the 
detector along the beam direction. The vertex selection 
for W + b events is about 97% efficient as measured in 
simulations. 

Electrons are identified using calorimeter and tracking 
information. The selection requires exactly one electron 
with transverse momentum > 20 GeV identified by an 
electromagnetic (EM) shower in the central {\r]'^\ < 1.1) 
or endcap (1.5 < \'r]'^\ < 2.5) calorimeter by comparing 
the longitudinal and transverse shower profiles to those 
of simulated electrons. The showers must be spatially 
isolated from other energetic particles, deposit most of 
their energy in the EM part of the calorimeter, and pass 
a likelihood criterion that includes a spatial track match. 
In the central detector region, an E/p requirement is ap- 
plied, where E is the energy of the calorimeter cluster and 
p is the momentum of the track. The transverse momen- 
tum measurement of electrons is based on calorimeter 
energy information. 

The muon selection requires the candidate to be recon- 
structed from hits in the muon system and matched to a 
reconstructed track in the central tracker. The transverse 
momentum of the muon must exceed > 20 GeV, with 
\ri^\ < 1.7. Muons are required to be spatially isolated 
from other energetic particles using information from the 
central tracking detectors and calorimeter Muons 
from cosmic rays are rejected by applying a timing crite- 



rion on the hits in the scintillator layers and by applying 
restrictions on the displacement of the muon track with 
respect to the selected PV. 

Candidate W + jets events are then selected by re- 
quiring at least one reconstructed jet with jr^j*^*! < 1.1 
and pJ^* > 20 GeV. Jets are reconstructed from energy 
deposits in the calorimeter using the iterative midpoint 
cone algorithm [l3l and a cone of radius AR = 0.5 in y-(p 
space [7|]. The energies of jets are corrected for detector 
response, the presence of noise and multiple pp interac- 
tions, and for energy deposited outside of the jet recon- 
struction cone. To enrich the sample with W bosons, 
events are required to have missing transverse energy 
]^ rp> 25 GeV due to the neutrino escaping detection. 

Background processes for this analysis are electroweak 
M^-l-jcts/7 production, production, tt and single 

top quark production, diboson production, and multijet 
events with jets misidcntificd as leptons. The W+b signal 
and SM background processes are simulated using a com- 
bination of PYTHiA V6.409 [III and alpgen v2.3 [12 | 
with PYTHIA providing parton showering and hadroniza- 
tion. We use PYTHIA Tune A with CTEQ6L1 [l3| par- 
ton distribution functions (PDFs) and perform a detailed 
GEANT-based [l3| simulation of the DO detector. The 
F-l-jets {V = W/Z) processes are normahzed to the 
inclusive W and Z'-boson cross sections calculated at 
NNLO [15| . The Z-boson pT distribution is modeled to 
match the distribution observed in data [l6| , taking into 
account the dependence on the number of reconstructed 
jets. To reproduce the VF-boson pT distribution in sim- 
ulated events, the product of the measured Z-boson pT 
spectrum and the ratio of W to Z'-boson px distribu- 
tions at NLO is used as correction. NLO-I-NNLL (next- 
to-next-to-leading log) calculations are used to normalize 
tt production [17|, while single top quark production is 
normalized to NNLO The NLO WW, WZ, and 

ZZ production cross section values are obtained with 
MCFM program [l^. For the VF-|-heavy-flavor jet (b or c 
quark) events, the ratio of the ALPGEN prediction to the 
NLO prediction for W + bb and W + cc is obtained from 
MCFM [1^ and applied as a correction factor. The simula- 
tion is also corrected for the trigger efficiencies measured 
in data. 

Instrumental backgrounds and those from semileptonic 
decays of hadrons, referred to as "multijet" background, 
arc estimated from data. The instrumental background 
is important for the electron channel, where a jet with 
a high electromagnetic fraction can pass electron identi- 
fication criteria, or a photon can be misidcntificd as an 
electron. In the muon channel, the multijet background 
is less significant and arises mainly from the semilep- 
tonic decay of heavy quarks in which the muon satisfies 
the isolation requirements. We require that the W bo- 
son candidates have a transverse mass Mt [IBl satisfying 
40 GcV + J, < Mt < 120 GeV to suppress multijet 
background and mis-reconstructed events. The average 
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efficiency determined in simulation for a W + b signal to 
pass these requirements is about 82%. 

Identification of b jets is crucial for this measurement. 
Once the inclusive PF+jets sample is defined, the jets con- 
sidered for b tagging are subject to a requirement called 
taggability. This requirement is imposed to decouple the 
performance of the 6-jet identification from detector ef- 
fects. For a jet to be taggable, it must contain at least 
two tracks with at least one hit in the SMT, pr > 1 GeV 
for the highest-pT track and pr > 0.5 GeV for the next- 
to-highest pt track. The efficiency for a jet to be taggable 
is about 90% in the selected phase space. 

The DO 6-tagging algorithm for identifying heavy fla- 
vor jets is based on a combination of variables sensitive 
to the presence secondary vertices (SV) or tracks dis- 
placed from the PV. This analysis uses an updated b 
tagger utilizing a multivariate analysis (MVA) [2l|, 22 1 
that provides improved performance over the previous 
neural network based algorithm [2^ . The most sensitive 
input variables to the MVA are the number of recon- 
structed secondary vertices in the jet, the invariant mass 
of charged particles associated with the SV (Mgy), the 
number of tracks used to reconstruct the SV, the two- 
dimensional decay length significance of the SV in the 
plane transverse to the beam, a weighted combination 
of the tracks' transverse impact parameter significances, 
and the probability that the tracks from the jet origi- 
nate from the PV, which is referred to as the jet lifetime 
probability (JLIP). The MVA provides a continuous out- 
put value that tends towards one for b jets and zero for 
non-& jets. Events are considered in which at least one 
jet passes a tight MVA requirement corresponding to an 
efficiency of « 50% for b jets. The likelihood for a light 
jet (m, d, s quarks and gluons) to be misidentified for 
the corresponding MVA selection is about 0.5%. Simu- 
lated events are corrected to have the same efficiencies for 
taggability and ^-tagging requirements as found in data. 
These corrections are derived in a flavor dependent man- 
ner [23| , using independent QCD enriched data samples 
and simulated events with enriched light and heavy jet 
contributions. Jets containing b quarks have a different 
energy response and receive an additional energy correc- 
tion of about 6% as determined from simulation. Figure[T] 
shows the transverse mass of the candidate events before 
and after applying 6-jet identification. 

In addition to the MVA output, we perform further 
selections using Msv and JLIP variables. Afgy pro- 
vides good discrimination between 6, c , and light quark 
jets due to their different masses [2^. The two vari- 
ables together take into account the kinematics of the 
event and, in order to further improve the separation 
power, they are combined in a single variable X'mjl = 
\ (Msv/(5 GeV) - ln(JLIP)/20) loose criterion 

for an event to pass at least 2?mjl > 0.1 is applied to 
remove poorly reconstructed events. The efficiency for 
signal events to pass this selection is about 97%. 



The numbers of expected and observed events before 
and after applying the &-jet identification in data and 
simulation are listed in Table ID The 6-tagging column 
includes the selection requirement on I?mjl- 



Process 


No 6-tag 


6-tag 


V^+heavy flavor 


41093 ± 8924 


5068 ± 1124 


l/+light flavor 


516661 ± 56734 


5718 ± 678 


Diboson 


4728 ± 519 


222 ± 26 


Top 


5431 ± 536 


1602 ± 181 


Multijet 


20527 ± 4458 


794 ± 180 


Expected events 


588440 ± 57610 


13405 ± 1338 


Data 


586289 


12793 



TABLE I: Numbers of events for data and contributing pro- 
cesses before and after applying 6-jet identification. Uncer- 
tainties include statistical and systematic contributions. The 
contribution of Z -I- jets events to the V-l-jets samples is ~ 5% 
for heavy and light flavor jets before and after 6-tagging. 
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FIG. 1: [color online] Transverse mass of the Iv system (a) be- 
fore and (b) after 6-jet identification. The data are shown by black 
markers, simulated background processes are shown by filled his- 
tograms. The data uncertainties are statistical only. An estimate of 
the systematic uncertainty on the simulated background processes 
is shown by the shaded bands 
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We measure the fraction of W+b+X events in the final 
selected sample by performing a binned maximum like- 
lihood fit to the observed data distribution of the 2?mjl 
discriminant in our sample shown in Fig. [2l The tem- 
plates for -I- light flavor, W + b, and M^-|-c jets shown in 
Fig. [5] are taken from the efficiency-corrected simulation. 
Expected contributions from Z-l-jets, single top quark, tt, 
diboson, and multijet production are subtracted from the 
data. After performing the fits, we obtain the number of 
events with different jet flavors listed in Table HH 

The measured cross sections arc presented at the parti- 
cle level by correcting for detector acceptance, selection- 
efficiencies, and 6-jct identification. Wc quote our result 
as a cross section in a restricted phase space: at least 
one 6-jct with pp"^ > 20 GeV, |77^"j'=*| < 1.1 and a muon 
with > 20 GeV and jj/'j < 1.7 or an electron with 
> 20 GeV and jr/'^l < 1.1 or 1.5 < Irj^'l < 2.5. For the 
neutrino momentum we require pi^ > 25 GeV. 





W ^ fiu 


W ^ eif 


Process 


Events Fraction 


Events Fraction 


W + b 


1306 ± 166 0.3 ± 0.04 


1676 ± 212 0.27 ±0.03 


W + c 


664 ±97 0.1 ±0.02 


1096 ± 159 0.18 ±0.03 


W + l.f. 


2152 ± 265 0.5 ±0.07 


3479 ± 425 0.56 ± 0.07 


Data-Bkgd 


4127 ± 150 


6255 ± 168 



TABLE II: Estimated numbers of VF±jet events from fitting 
the flavor-specific processes, along with the expected back- 
ground of W boson processes and the data after subtracting 
Z±jets, single top quark, tt, and diboson background pro- 
cesses, l.f. stands for light flavor jets. Uncertainties include 
statistical and systematic contributions. 



Systematic uncertainties are determined by varying ex- 
perimental parameters and efficiency/acceptance correc- 
tions by one standard deviation and propagating the ef- 
fect on I?MJL- The systematic uncertainties are domi- 
nated by effects related to the measurement of jets. The 
contributions from jet energy resolution, jet modeling, 
and detector effects are about 2.5%, 3%, and 4%, re- 
spectively. Uncertainties on 6-jet identification are de- 
termined in data and simulations by using 6-jet-enriched 
samples and are about 2%— 5% per jet. The uncertainties 
due to lepton identification are about 2%. The integrated 
luminosity is known to a precision of 6.1% [25|. The un- 
certainty of the template fit is estimated by varying the 
normalization and shape from the data corrections of the 
W boson processes and the fit parameters (about 6%). 
By summing the uncertainties in quadrature we obtain 
a final total systematic uncertainty on the cross section 
measurements of approximately 12%. 

The cross section times branching fraction is calculated 
by dividing the number of signal events measured by in- 
tegrated luminosity (£), acceptance (A), and efficiencies 
(e) of the selection requirements: 
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FIG. 2: [color online] Contributions of the various jet flavors nor- 
malized to the measured cross section obtained from a fit in the 
W — > iiu channel on both (a) linear and (b) logarithmic scales. 
The various W + jets processes are shown as filled histograms and 
data, after the subtraction of contributions from DrcU-Yan, dibo- 
son, and top quark production, arc represented v^ith black markers. 
The uncertainties include both statistical and systematic contribu- 
tions. 



a{W + b) ■ B{W £v) 



w+b 



(1) 



where e is given by the product of the trigger, object 
reconstruction, and selection efficiencies. 

We first present results separately for the muon chan- 
nel and electron channel because they are performed in 
slightly different requirements on the phase space of the 
lepton and then combine using a common phase space. 
We measure from the cross section in the muon chan- 
nel where W — >■ \iv in a visible phase space defined by 
p^ > 20 GeV, jTy'^l < 1.7 with at least one 6-jet limited 
to pp''^ > 20 GeV and lyy^-j'^'l < 1.1 as. 



a(W + b) ■ B(W ^ fiv) = 
1.04 ± 0.05 (stat.) ± 0.12 (syst.) pb. 



(2) 
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We perform an NLO QCD pred iction using MCFM v6.1, 
based on CTEQ6M PDF [ij] and a central scale of 
+ 2mh, where = 4.7 GeV is the mass of the 
b quark. Uncertainties are estimated by varying renor- 
malization and factorization scales by a factor of two in 
each direction, varying mi, between 4.2 and 5 GeV, and 
by using an alternative PDF set. The MCFM calculation 
predicts a{W + b) ■ B{W ^v) = 1.34 t° j° (scale) ± 
0.06 (PDF) 1°;°^ {rrib) pb. Predictions obtained us- 
ing SHERPA vl.4 and CTEQ6.6 PDFs ^ lead to a 
value 1.21 ±0.03 (stat.) pb. Using madgraphS [H] with 
CTEQ6L1 PDFs, we obtain 1.52±0.02 (stat.) pb. Uncer- 
tainties for scale variations, PDFs, and the 6-quark mass 
are on the order of about 30%. 

In the electron channel, we measure the cross sec- 
tion times branching fraction by selecting > 20 GeV, 
|77^| < 1.1 or 1.5 < 177*^1 < 2.5, at least one 6-jet as above 
and obtain 

a{W + b) ■ B{W ^ev) = 
1.00 ± 0.04 (stat.) ± 0.12 (syst.) pb. 

The MCFM calculated cross section for this channel 
is a{W + b) ■ B{W -> ev) = 1.28 ^0^:^ (scale) ± 
0.06 (PDF) 1°;°^ (mfc) pb. The SHERPA prediction is 
1.08 ± 0.03 (stat.) pb, while the madgraphS prediction 
is 1.44 ± 0.02 (stat.) pb. The combined systematic effect 
scale, PDF and variations is also around 30%. 

Using the MCFM prediction we extrapolate the mea- 
surement in the electron final state to the same selection 
requirements as the muon final state to allow for a con- 
sistent combination. Combining the results in — > j.w 
and W ^ ev decays we obtain 

ct{W H- b) ■ B(W ~^iv) = 
1.05 ± 0.03 (stat.) ± 0.12 (syst.) pb. ^ ' 

The small experimental uncertainty should allow to 
further constrain theoretical predictions. In summary, we 
have performed a measurement of the inclusive cross sec- 
tion for W boson production in association with at least 
one 5-jet at y/s = 1.96 TeV, considering final states with 
W ^ fjiv {W — > ev) events in a restricted phase space of 
> 20 GeV, |r;^| < 1.7 (1?^ | < 1.1 or 1.5 < \if \ < 2.5), 
with b jets limited to > 20 GeV and |r7''-j°*| < 1.1. 

The measured cross sections agree within uncertainties 
with NLO QCD calculations and predictions obtained 
using the SHERPA and madgraph generators. 
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